This notebook was put together by [Jake Vanderplas](http://www.vanderplas.com). Source and license info is on [GitHub](https://github.com/jakevdp/sklearn_tutorial/).
Preliminaries: Setup & introduction (15 min)
Basic Principles of Machine Learning and the Scikit-learn Interface (45 min)
Supervised learning in-depth (1 hr)
Unsupervised learning in-depth (1 hr)
Model Validation (1 hr)
This tutorial requires the following packages:
numpy
version 1.8 or later: http://www.numpy.org/scipy
version 0.15 or later: http://www.scipy.org/matplotlib
version 1.3 or later: http://matplotlib.org/scikit-learn
version 0.15 or later: http://scikit-learn.orgipython
/jupyter
version 3.0 or later, with notebook support: http://ipython.orgseaborn
: version 0.5 or later, used mainly for plot stylingThe easiest way to get these is to use the conda environment manager. I suggest downloading and installing miniconda.
The following command will install all required packages:
$ conda install numpy scipy matplotlib scikit-learn ipython-notebook
Alternatively, you can download and install the (very large) Anaconda software distribution, found at https://store.continuum.io/.
In [ ]:
from __future__ import print_function
import IPython
print('IPython:', IPython.__version__)
import numpy
print('numpy:', numpy.__version__)
import scipy
print('scipy:', scipy.__version__)
import matplotlib
print('matplotlib:', matplotlib.__version__)
import sklearn
print('scikit-learn:', sklearn.__version__)
import seaborn
print('seaborn', seaborn.__version__)